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(54) Tide: NOVEL ENDOGLUCANASE ENZYME 



(S7>~Abstracl - 

The present invention concerns a novel enzyme, EGV, having endoglucanase activity. The molecular weight of the enzyme is about 
20 to 25 kDA and it is isolated from the fungus Trichoderma reesei. The invention also relates to a DNA sequence coding for (be novel 
enzyme as well as vectors, yeast strains and fungal strains containing the DNA sequmrr Furthermore, the invention concerns a method 
for isolating the DNA sequence coding for the novel enzyme and Cor constructing yeast and fungal strains which are capable of expressing 
eodoglucansse. The invention also provides an enzyme product having endoglucanase activity and methods for enzymatically nxxhfying 
lignoceUulosic materials, in particular for modification or degradation of ceUnlose and/or 0-glucan. 
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xinwi . KNDOqT V™P ENZYME 



° NA T ^ ZLore. the invention concerns a metnoa 

for isolating the DNA sequence coding for the novel enzyme 

and/or 0-glucan. 
porlr gmuiiif «f «hg invention 

-Many fungal'Species-produce enzymes thantegrad* plant polymeis into simple compounds 

cnoiqnandl 4-ft-D-glucosidases (EC 4.3.2.21). ine 
i-^c 4-fi-D-elucanases, EC 3.2.1.91) ana 

CL. «.«.«<• »»■-*— '-• 

1987, Salohcimo et al. 1988). 

« p. 0, • cello*. — . - - <«^^72 £Z°< • 
cellulose substrate is aimed at. it is generally requa™ 

MM- — ""X**™*'- * ,,,e,,a0 ' ,t, ' 
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Trichodenna reesei produces such enzyme mixtures. 

It is also known that the cellulase enzymes belonging to the same class are mutually different 
as regards their activities towards some of the cellulosic substrates. For example, CBrHI will 
5 catalyze hydrolysis of 0-glucan whereas CBHI is inactive. toward that substrate. 

The cellulase enzymes usually consist functionally of two different parts, viz. a core and a 
tail, which are interconnected by an intermittent part (known as the linker). The active centre 
of the enzyme is located in the core. The function of the tail consists mainly of its capability 
10 to attach the enzyme to an insoluble substrate. Thus, if the tail is removed the activity of the 
enzyme toward macromolecular and crystalline substrates can be substantially decreased. 

By way of a general definition, the name "endoglucanases" is assigned to enzymes that 
catalyze random hydrolysis of 0- 1-4 glycosidic bonds between glucose units of cellulose 

15 polymers. The two major Trichoderma endoglucanases, EGI and EGO, contain about 500 to 
600 amino acids and their molecular weights are about 50 to 60 kDa. Also the cellobiohydro- 
lases are similar in size. These kinds or rather bulky molecules may have difficulties in 
_ penetrating -some fibrous substrates whose adjacent polysaccharide chains are aligned and 
located close to each other. Such substrates are represented by fibrous materials of great 

20 economic values, such as cellulose pulp. Therefore, endoglucanases of a low molecular 
weight have been of an increasing interest during the last years. 

Hakansson et al. (1978) have purified a small endogiucanase from culture filtrates of 
7. reesei. This enzyme has a size of about 20 kDa, a neutral pi and, unlike the major 

25 cellulases, it does not contain carbohydrate moieties. Hakansson et al. found the enzyme to 
be present in the culture medium in very small amounts. Small endoglucanases of similar 
properties have also been isolated by Gong et al. (1979) and Olker and Sprey (1990). 
However, although the molecular weight of the endogiucanase isolated by Hakansson and 
partially sequenced by Stahlberg in 1991 is rather low, the molecular configuration of the 

30 enzyme is not advantageous as far as enzymatical applications are concerned. The molecule 
_ appears. noLto contain a linker domain and a cellulose binding domain (CBD) but only a core 
domain. The cellulose binding domain and a linker region, allowing for its flexible separation 
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from the catalytic cor,, are essential features of true cellulases capable of efficient attachment 
to the substrate. 

The PCT Application No. PCT/US91/07276 discloses an endoglucanase enzyme, called 
5 - EGm derived from Tnckodema. The molecular size of the EGm is 23 to 28 kDa. its pH 
optimum is 5.5 to 6.0 and the P I 7.2 to 8.0. From the sequence data of EGIH. it is apparent 
that said enzyme is the same as the one isolated by Hakansson and sequenced by Stihlberg 
and that it does not contain the linker and CBD domains. 
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Known in the art are also small endoglucanase enzymes isolated from other microorganisms. 
Thus, a gene coding for a polypeptide homologous to the short amino acid sequence avauable 
from the protein described above has been isolated from the fungus Aspergillus aculeatus 
(Ooi et al. 1990). The PCT Patent Application No. PCT/DK91/00123 describes an 
endoglucanase derived from the fungus Humicola insolens. The size of the polypepade 
molecule is 43 kDa and its isoelectric point is 5.1. The use of the enzyme for treatment of 
cellulose-containing fabrics is suggested. 

Nothing has so far been reported on the existance of a small size, true Trichoma endoglu- 
canase having cellulose binding regions. It has been a general conception that the ceUulase 
system of Trichodema consists of at least two CBH:s and two EGs and additionally of the 
EGID which lacks a CBD. 

isolation and manipulation of the cellulase genes is very important for the various commer- 
cial uses of enzymes and of the organisms producing them. Isolation of hydrolase genes from 
eukaryotes has been a task demanding either extensive studies on the corresponding enzymes 
or the laborous differential hybridization protocols. 



Summary of fo e invention 

30 It is an object of the present invention to provide a novel endoglucanase enzyme of 
- - molecular weight and having a suitable configuration for enzymatical applications. 
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This invention provides an endoglucanase enzyme derived from Trichoderma reesei which (in 
unglycosylated form) has a molecular weight of about 20 to 25 kDa and contains 242 amino 
acids (the mature protein contains less amino acids that that depending on the signal sequence 
cleavage site), some 70 % of which are located in the core region, whereas roughly one sixth 
5 ~ of the amino acids is in the linker, taking an extended conformation, and one sixth in the 

CBD domain. This distribution of the amino acid residues within the molecule gives evidence 
of it having an elongated, "wormislT form in comparison to other cellulases, which 
facilitates penetration between adjacent molecules of fibrous cellulosic substrates. Being 
different in structure and activity, the enzyme complements the cellulolytic enzyme mixture- 
10 acting in synergy, as the Examples below will show. 

Another object of the invention is to provide a simple and rapid method for isolation of 
endoglucanase genes by function. In fact, the method described in more detail below, makes 
it possible to isolate any hydrolytic enzyme gene, such as genes coding for cellulases (for 
instance endoglucanases and cellobiohydrolases) and hemicellulases (for instance xylanases 
and mannanases), without previous knowledge of the corresponding proteins. In this 
connection it should be pointed out that before this invention there did not exist any data on 
protein level which would have suggested the existence of the novel endoglucanase described 
herein. This fact is already indicative of the unusual properties resulting in its disregard in 
the biochemical characterization of the cellulase mixture produced by Trichoderma. 

According to the present method, an expression cDNA library is made from the organism of 
choice into a yeast expression vector. Yeast transformants are screened on plates containing 
_ the substrate of the desired activity. Using our earlier finding (Penttila et al. 1987, 1988) that 
yeast produces and secretes the major cellulases of T. reesei in active form, the enzymatic 
activities can be visualized on substrate plates. 

In the following description of the present invention, the novel gene coding for the novel 
endoglucanase enzyme is characterized as is its transfer into, and the expression thereof, in 
suitable hosts, such as fungi of the genus Trichoderma, in particular various Trichoderma 
reesei strains, and yeasts, such as Saccharornyces cerevisiae. 
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The invention also provides yeast and fungal strains transformed with the gene coding for the 
novel endoglucanase enzyme. Finally, applications of the enzyme are suggested. 

y r j r f ^ri ption of drawings 

Fig 1 shows the nucleotide sequence of the gene egl5 coding for the novel enzyme, EGV 
Fig 2A shows the cellulose binding domains and Fig. 2B linker regions of EGV compared 
with the same domains and regions of the other Trickoterma cellulases. In Fig 2B, the senne 
and theronine residues have been boxed. 

Fig. 3 shows the endoglucanase gene e g l5 integrated into plasmid P AJ401 resulting m 

plasmid pAS4. 

Fig . 4 shows the endoglucanase gene e g l5 integrated into plasmid P MlX>16del5 resulting » 
plasmid pAS16. 

Fig. 5 shows the structure of plasmid pML016, 
15 Fie 6 shows the structure of plasmid pML016del5(l 1), 

Figs. 7a to 7d depicts the construction of me e g l5 expression plasmid P AIK956. Fig. 7a also 
indicating the structure of plasmid pAS13, 

Fig. 8 indicates the relative activity of the novel endoglucanase enzyme as a function of the 
pH, 

20 Fig. 9 shows the P H stability of the enzyme, and 

Fig. 10 shows the introns and coding sequence of the egl5 gene. 

Pttniln 1 *Trr i v iion of thr tore"*' 0 " 
25 In the following description, the following abbreviations and definitions are used: 

A>hneviations: 

aa amino acid(s); bp, base pair(s); CBD. cellulose-binding domain; CBH. cenobiohyorolase; 
30 OA. gene coding for CBH; CMC. carboxymethy, cellulose; EG. endoglu^nase; ^g- 
coding for EG; HCA, hydrophobic duster analysis; HEC. hydroxyethyl cellulose, tt. Id* 
base(s)- kDa. kilo dal«on(s); MUC. 4-methyl-umbelliferyl B-D-cellobioside; MUL. 4-methyl- 
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umbelliferyl B-D-lactoside; NMR, nuclear magnetic resonance; PCR t polymerase chain 
reaction; PGK> 3-phosphoglycerate kinase gene of Saccharomyces cerevisiae; pi, isoelectric 
point. 

5 Definitions: 

Within the scope of the present invention, the term "cellulase" is used as a collective term 
which encompasses enzymes catalyzing reactions which participate in the degradation of 
insoluble cellulose or cellulosic substrates to soluble carbohydrate. "Cellulase" is known in . 
10 the art to refer to such a group of enzymes. As mentioned above, for hydrolysis of cellulose 
to glucose, three cellulase enzymes (three types of cellulase enzyme activity) are needed: ran- 
domly cleaving endoglucanases (1,4,-0-D-glucan glucanohydrolase, EC 3.2.1.4) which 
usually attack substituted soluble substrates; cellobiohydrolase (1 ,4-0-D-glucan cellobiohydro- 
lase, EC 3.2.1.91) which is capable of degrading crystalline cellulose but has no activity 
15 towards derivatized cellulose and 0-glucosidase (0-D-glucoside glycohydrolase, EC 3.2.1.21) 
which degrades cellobiose and cello-oligosaccharides to yield glucose. Each of the three main 
- types of enzymes listed above occurs in multiple forms. For example, two immunologically 
distinctive cellobiohydrolases, CBHI and CBHII are known. In addition, at least two distinct 
endoglucanases are known. Synergistic action between some of these enzymes has been 
demonstrated. "Cellulase activity" is synonymous with cellulolytic activity. 

Enzymes having "endoglucanase activity" are, within the scope of the present invention, 
enzymes which will catalyse the hydrolysis of internal 0-1,4-linkages of cellulose. 

By "enzyme preparation" is meant a composition containing enzymes which have been 
extracted from (either partially or completely purified from) the microorganisms (for instance 
the fungi) producing them. The term "enzyme preparation" is meant to include a composition 
comprising medium used to culture such microorganisms and any enzymes which the micro- 
organisms have secreted into such medium during the culture. 

"Culture medium" denotes a medium previously used to culture a fungi ("spent" culture 
medium), such culture medium containing enzymes which the fungi have secreted into the 
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^ to i* «*» — > » - — » ■»* " " """" " 

compleiely purified, "mm"*. » 

B, -hvoridtaoon — < "« mn ' ,! ™" °' ™" 

LeL hybridize » to DNA <* * "**""» '"'' ,,,8 

beh$ about 20 to 25 kDa and containing 242 amino ..ids <to — I— ««i *" 

amino acids). 

-Gene" denote, a DNA sequence containing a template for a RNA polymerase. RNA that 
codes for a protein is termed messenger RNA (mRNA). 

lt is well known that mutations occur in genes and can cause changes in the amino acid 

I^es. As used herein, the term e g <5 gene includes aU DNA seances homdogous w«h 
fc science herein disclosed for «* and encoding polypeptides with the fu^onal or 

cellulases Ldng the .inker and CBD regions still exhibit catalytic actmty towards the * 
TZ ^Lge, and - a smaUer core polypeptide is also included in the denote of 
^ Sequences artificially derived from this gene but still encoding a polypepttde wtfh the 
egl5. Sequences anu , nrnnFrt ,cs are also included and encompassed by the expression 
desired factional or structural properties are also mciuuco r 

"functional equivalents". 

A cloning vehicle or a vector is a plasnud or phage DNA or otor DNA ^ 8 

UnearDNA) which provides . W -*> acid * 

^ te _1, d. no. a—* replicate .ad ins^. merely provrfe a vein* ** 

:~=ri=r- sr.- - 



inted from Mimosa 04/26/2000 



WO 94/25117 



PCT/F194/00234 



8 

which DNA may be spliced in order to bring about replication and cloning of such DNA. 
The cloning vehicle may further contain a marker suitable for use in the identification of 
cells transformed with the cloning vehicle. Markers, for example, are tetracycline resistance 
or ampicillin resistance for E. coli and for example phleomycin resistance or acetamidase for 
5 Trichoderma. The word "vector" is sometimes used for "cloning vehicle. w Alternatively, 

such markers may be provided on a cloning vehicle which is separate from that supplying the 
gene of interest. 

A vehicle or vector similar to a cloning vehicle but which is capable of expressing a gene of 
10 interest which has been cloned into it, after transformation into a desired host, is called an 
" expression vector. In a preferred embodiment, such expression vehicle provides for an 
enhanced expression of a gene of interest which has been cloned into it, after transformation 
into a desired host. 

15 The gene of interest which is provided to a fungal host as part of a cloning or expression 
vehicle integrates into the fungal chromosome. Sequences which derive from the cloning 
vehicle or expression vehicle may also be integrated with the gene of interest during the 
integration process. 

20 The gene of interest may preferably be placed under the control of (i.e., operably linked to) 
certain control sequences such as promoter sequences provided by the vector (which integrate 
with the gene of interest). If desired, such control sequences may be provided by the fungal 
host's chromosome as a result of the locus of insertion. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a polypeptide if 
it contains expression control sequences which contain transcriptional regulatory information 
and such sequences are "operably linked" to the nucleotide sequence which encodes the 
polypeptide. 
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fr/fpthnd of isn|atinp genes 

Bacterial cellule gene, have widely been isolated by transforming genomic libraries into E. 
coli and screening activities on ceMose^ontaining plates (reviewed by Begutn et al. 1987)^ 
This approach relies on the functionality of promoter sequences from other prokaryotes «£. 
coli and is not applicable to eukaryotes. Furthermore, eukaryotic genes, such as the T.reeset 
EGV described here, contain introns which cannot be excised in E. coli and thus dtsturb the 
reading frame. Moreover, the TricHodema cellulases cannot generally be expressed m E. 
coli in active form even if expressed from cDNA coupled to bacterial expression stgnals. ■ 
Traditionally fungal cellulase genes have been cloned using either differential hybridizauon. 
antibodies raised against the corresponding enzymes or hybridization with oligonucleoude 
pro *s based on the protein sequence of the enzymes (Beguin et al. 1987). All these methods 
are laborous and demand a lot of time and previous knowledge of the corresponding 
enzymes. 

With the method according to the present invention, genes coding for new activities can be 
easily isolated without any previous knowledge of the protein. According to the invention, a 
fungal strain (e.g. TricHodenna) is cultivated on a culture medium which will induce enzyme 
production. Such culture medium typically contains ceUulosic substrate, if endoglucanase 
production is aimed at. After cultivation, the mRNA of the strain is isolated and the 
corresponding cDNA is formed. cDNA made from the organism of interest is cloned mto a 
yeas, vector to construct an expression gene library in yeast, for instance Saccharomyces 
cerevisiae . The genes of the fungus are men expressed under any suitable promoter provdmg 
sufficient expression level, such as the yeast promoter PGK. The enzyme, e.g. 
endoglucanase. is extracellularly secreted and the colonies producing the desired enzymes, 
e .g. the endoglucanase. can be identified on the basis of their production of enzyme actmty . 

Screening can be effected with activity plate assays. Thus, according to one preferred 
cnbodiment of the present invention, the endoglucanase gene is isolated by plating the 
. expression,ibrary onto plates containing barley fl-glucan as substrate. Afte, growth the ceils 
are washed away and the plates are stained with congo red to reveal the hydrolysa ; halos. Up 
to 50 % of the clones giving halos may contain endoglucanase. The genes codmg for 
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different endoglucanases can be identified by analyzing the clones. 

The expression gene library can also be constructed by using some other yeast promoter 
which will provide a weaker level of expression. If it is to be expected that the enzyme is 
5 deleterions to the yeast, the inducible GAL1 promoter would be recommendable. It is also 
possible to use the endoglucanase's own promoter and, for the purpose of isolating the genes, 
a chromosomal gene library can, in some cases, be used. The gene library can also be 
constructed in a single copy plasmid. Also any other yeast strain with established trans- 
formation procedures can be used as a host, because their secretion capabilities are usually . 
10 even higher than that of Saccharomyces. 

In summary, the invention comprises the steps of 

- enriching the rnRNA pool of a fungal strain, e.g. Trichoderma, producing 
endoglucanase activity in respect of the mRNA of the endoglucanase by cultivating the 

15 strain in conditions which will induce the endoglucanase production of said strain, 

- isolating mRNA from the strain, 

- preparing cDNA corresponding to the isolated mRNA, 

- placing the cDNA thus obtained in a vector under the control of a suitable promoter, 

- transforming the recombinant plasmids into a yeast strain which naturally does not 
20 produce significant amounts of the endoglucanase in order to provide an expression 

library, 

- cultivating the yeast clones thus obtained on a cultivation medium in order to express 
the expression library in the yeast, 

- separating the yeast clones producing endoglucanase from the other yeast clones, 
25 - isolating the plasmid-DNA of said separated yeast clones, and, 

- if desired, sequencing the DNA in order to determine the DNA sequence coding for 
the endoglucanase. 
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•H.e gene e gl 5 isolated accotding to the above method was sequenced according to conve. 
lional rnethods. The DNA seance of e g l5 is shown in Figure 1 and also Seated » SEQ 
5 ID NO. 1. 

Th, get. eg H codes for a previously unknown «< HI — — ■ * "* 
^ of which is depicted - SEQ ID NO. 2. M^^. - I-* " - 

1* domains •-• * * «*— "°* ly * ""-'"^f"* _ 

regions comp**, those — . in *. .. — 

being the part of the sequence malted with the letter B, whereas the cenotosebindmg 
domain is marlced with the tetter A. The putative N-glycosylatiaii site is 01aI ^'*^ ^ 

15 Thr-^-Leu^-Uu-Ite-V^AU-Vd^t-Ah.). which is untolmed » Ft*™ 1. 
consists of 225 amino acids and has a calculated molecular weight of 22.799 KTJa. 
The core of lbs endoglucanase is separately depicted in SEQ ID NO. 3. It would a ^^ f ^ 1 ^ 

novel enzyme. Thns, it is concert rh« an er^lue^ pr^t »ay » I.™** 

comprise the polypi of 0. cor. domain only. Bowser. » r^T^m 

25 three pans. 

It is believed that the predicted 17 aa signal peptide indicated in Figure 1 can he substituted 
bv archer suiudtte sign- pepdd. P— * of a differer, tettfh. Such . si** . 

30 -^rhyd^lc^^.^^™*^^''^™ 
weight of the acdve polypepdde may », somewhat ard 0. »«1 endoglucause . 
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therefore referred to as having a molecular weight in unglycosylated form of about 20 to 25 
kDa. 

In its O- and N-glycosylated form the enzyme can be significantly bigger having apparent 
5 molecular weights of 35 kDa or even much higher when produced in the yeast 

Saccharomyces. Furthermore, cellulases frequently undergo drastic proteolytic cleavage 
which removes the CBD (and linker) regions and consequently the size of EGV in fungal 
culture medium can be even about 115 IcDa in unglycosylated form. 

10 - While modelling protein conformations from first principles is not possible, the high 

sequence similarity between fungal CBDs warrants the construction of a homology model 
(Sali et al. r 1990). The feasibility of modelling side chain conformations has been 
demonstrated in similar cases (Blundeli et al, 1988, Heiner et al. 9 1993). Modelling of the 
EGV CBD revealed some interesting differences compared to the known structure of the 

15 CBHI CBD. This wedge-shaped domain seems less sharp in EGV and there are some 
differences in main chain and side chain conformations and in hydrophobic properties in 
areas known to be important for binding of the CBHI CBD onto the cellulose surface or for 
the full activity of the CBHI enzyme against crystalline cellulose. Preliminary binding data 
indicate that the EGV CBD is able to bind to cellulose. 

20 

The protein belongs to a new family K of cellulases together with the endoglucanase B of 
Pseudomonas fluorescens and the endoglucanase V of Humicola insolens as studied by hydro- 
- phobic cluster analysis by Henrissat and Bairoch (1993). This strongly suggests that EGV is 
structurally different from all Trichoderma cellulases characterized so far. Based on this, it 
25 would also appear that there are catalytic differences between the present enzyme and the 
other cellulases. The fact that EGV is a true endoglucanase was confirmed by l H-NMR 
spectroscopy, which showed that the internal 0-1,4-linkages were hydrolysed by EGV when 
barley 0-glucan (a soluble glucose polymer containing 0-1,4- and 0-L3-linkages) was used as 
substrate. 

30 

Thus, for instance, as evidenced by Example 1 1 below, the novel endoglucanase appears to 
work synergetically with the known endoglucanase EGII on hydroxyethyl cellulose. 
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pvprAccmri nf tfre pene eel5_ 

Once the vector or DNA sequence containing the construct(s) is prepared for expression, the 
DNA construct*) is introduced into an appropriate host cell by any of a variety of suitable 
5 means, including transformation as described above. After the introduction of the vector, 
recipient cells are grown in a selective medium, which selects for the growth of transformed 
cells. Expression of the cloned gene sequence*) results in the production of the deseed 
protein, or in the production of a fragment of this protein. This expression can take place » 
a continuous manner in the transformed cells, or in a controlled manner. 
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Expression of the gene can be obtained in any fungus with developed transformation and 

expression methods. . 

Trichoderma is an especially useful and practical host for the synthesis of the enzyme 
preparations of the invention because Trichoderma is capable of secreting protein at large 
amounts, for example, concentrations as much as 40 g/L culture fluid have been reported; 
the homologous Trichoderma cbhl promoter provides a very convenient promoter for 
expression of genes-of-interest because it is a strong, single copy promoter which normally 
directs the synthesis of up to 60 % of the secreted protein from the Trichoderma host; the 
transformation system is highly versatile and can be adapted for any gene of interest; the 
Trichoderma host provides an "animal cell type" high mannose glycosylate pattern; and 
culture of Trichoderma is supported by previous extensive experience in industrial scale 
fermentation techniques. In addition, several promoters active on glucose medium can be 
used, which enable the production of the enzyme essentially free from other ceUulases. 

Expression of the protein in the Trichoderma hosts requires the use of regulatory regions 
functional in such hosts. A wide variety of transcriptional and transnational regulatory 
sequences can be employed, since Trichoderma generally recognize eukaryotic host 
transcriptional controls, such as. for example, those of other fuamentous fungi. Such control 
regions may or may not provide an initiator methionine (AUG) codon. depending on whether 
the cloned sequence contains such a methionine. Such regions will, in general, include a 
promoter region sufficient to direct the initiation of RNA synthesis in the host cell. 
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According to the invention the DNA sequence encoding EGV can be transformed into 
Trichoderma and expressed, for example under the strong cbhl promoter, as described in 
EP-A 244,234 and US 5,298,405, or other promoter functional in Trichoderma. The DNA 
sequence coding for EGV can be integrated into the general expression vector pAHMHO. 
5 The transformation can be done as a cotransfonnation using two circular plasmids, the 

selection marker being located in one of the plasmids and the DNA sequence encoding eglS 
in the other, or the selection marker and the DNA sequence encoding the eglS can be located 
in the same plasmid, or linear fragments can be used in the transformation. Possible selection 
markers are, for instance, trpC or argB from Aspergillus nididans or argB or pyr4 from 7*.. 

10 reesei or amdS from A. nidulans or trpl from Neurospora crassa or phleomycine or hygro- 
mycine resistance markers from bacterial origin (EP-A 244,234, US 5,298,405, and EP-B 
539,395 and Ulhoa et a]., 1992, Transformation of Trichoderma species with dominant 
selectable markers, Curr. Genet 21:23-26) or other selection marker shown to function in 
Trichoderma in future (Karhunen et al. 1993, High frequency one-step gene replacement in 

15 Trichoderma reesei I, Endoglucanase I overproduction, MGG, 241: 515-522, and Suominen 
et ah, 1993, High frequency one-step gene replacement in Trichoderma reesei II, Effects of 
deletions of individual cellulase genes. MGG, 241: 523-530. 

To construct a Trichoderma strain producing endoglucanase V as the main cellulolytic enzy- 
20 me it is possible to construct Trichoderma strains that do not produce the endoglucanases I 
and II or all other cellulolytic enzymes: endoglucanase I and II and cellobiohydrolase I and 
II. The desired cellulolytic genes can be made deficient (EP-A 244,234, US 5,298,405, 
Karhunen et al. (1993) and Suominen et al. 1993). If genes are expressed under the cbhl 
promoter the expression is repressed by glucose and thus the strains must be grown on 
25 cellulose-containing medium. 

Alternatively, it is possible to construct Trichoderma strains expressing EGV under glucose 
promoter. This means that the Trichoderma strains expressing EGV can be grown on glucose 
containing medium. Possible glucose promoters are, for example, glucose derepressed cbhl 
30 " promoter of the piasmid pML016del5(ll) {et al. , 1992) and the promoter of the cDNAl 
gene (Nakari et al., 1992) or other glucose promoters. 
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According to .he invention, there is also provided a method for producing in fungal and yeast 
hosts, such as the yeast Sacckaron^ces and filamentous fungi, such as TricHodernv, an 
enzyme preparation having an endoglucanase activity stemming from an endoglucanase 
enzyme, the molecular weight of which (in unglycosylated form) is 20 to 25 kDa. 

Further, if desired activities are present in more than one recombinant host, such 
preparations can be isolated from the appropriate hosts and combined prior to use m the 

method of the invention. 

10 pn-rytnp preparations 

To obtain the enzyme preparations of the invention, containing elevated 
fce recombinant hosts described above haying the desired properties (that ts, hosts cap ble of 
expressing the novel endoglucanase enzyme) are cultivated under suitable cond.uons (cf. 
15 above) . the desired enzymes are secreted from the host into the culture medtum. and the 
enzyme preparation is recovered from said culture medium by methods lenown m the an. 

As mentioned above, the enzyme preparation can be produced by cultivating the ^1™° 
in conditions where the regulatory regions directing endoglucanase express.cn are operatmg. 

• • —A^m if the veast PGK or Trichoderma glucose promoters 

« «d Tto if V is — »*— P~-». *■ 

ntonttoH* promoter i. ~>. * »" ,vaMn ,, " a ™ 7' 

ap » (sm *o, SB 2023, Schill & Mb*. ™»- ™°*™° 
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strains are sensitive to glucose repression and require an inducer (cellulose, lactose or 
sophorose). The pH should preferably be kept at approximately pH 5 to 6 by the addition of 
phosphoric acid or ammonia and the temperature at 30 °C during the cultivation. 

5 The enzyme preparation is recovered from the culture medium by using methods well known 
in the art. However, the enzyme preparations of the invention may be utilized directly from 
the culture medium with no further purification. If desired, such preparations may be lyophi- 
lized, immobilized or the enzymatic activity otherwise concentrated and/or stabilized for 
storage. ^ 

10 - 

If desired, the expressed endoglucanase protein may be further purified in accordance with 
conventional conditions, such as extraction, precipitation, chromatography, affinity chromato- 
graphy, electrophoresis, or the like. 

15 Applications of the novel enzvme 

The catalytic core of the novel enzyme is the smallest of fungal or bacterial cellulases 
. characterized. Therefore the enzyme and the enzyme preparations according to the invention 
have application in the treatment of pulp and paper and in the textile industry. Furthermore, 
20 the enzyme can be used in the fodder industry. The properties of the novel endoglucanase are 
unexpected for a endoglucanase on basis of general knowledge. 

Being a 0-glucanase, the novel enzyme can be used for hydrolyzation of the £-glucan of 
barley. As a result, the viscosity of the fodder is lowered and the nutritional value of the 
25 fodder is improved. 

As evidenced in Example 8, the pH optimum of the enzyme is higher than those of the other 
endoglucanases produced by strains of the species Trichoderma. This favorable pH range can 
be utilized in many ways. One preferred application is for removing colour from denim 
30 jeans; in acidic pH, reabsorption of the colour occurs, but at neutral pH there is much less 
reabsorption. Another preferred embodiment comprises deinking. Normally, the pH of a 
slurry of water and newsprint is about 5.5 to 6.0 and therefore the novel enzyme can be used 
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without any need for adjustment of the pH. On the other hand, coated paper contains fillers 
and pigments which will raise the pH of an aqueous paper slurry formed therefrom. If the 
P H of the slurry is lowered by adding mineral acid, at least some of the suspended or 
dissolved fillers and pigments may precipitate, e.g. in the form of calcium sulphate. 

5 The small size and the advantageous pH range of the novel enzyme make it possible to use it 
for treating recycled fibre in order to improve the technical properties thereof. The enzyme is 
also applicable for improving pulp drainage. 

The invention is described in more detail with the aid of the following non-limiting 
examples. 



15 



In the examples, the following strains and vectors were employed: E. coU strains PLK-F', 
pBluescript SK". and XL-l-Blue (Stratagene) were used as hosts for plasmids and PLK-F' as 
a host for the cDNA library. The following plasmids were used: pASll, P AS13, P ALK487 
and PALK183. The T. reesei strain QM9414 was used as a source of RNA for cDNA 
preparation and Northern analysis. T. reesei ALK02221 and ALK03524 were used as hosts 
for EGV expression. S. cerevisiae strain DBY746 (a hM 1 lsul-3 10*112 unfi-52 SbI-289 
cytf cj£*) was used as a host for the expression library. Strain MD4(Mc (« urjfi fl* ku2-3 
Ieu2-112 Wsi-11 hM-15) was used as a host for the plasmids pMP311, P MS3. pMPU and 
pMP29 carrying egll, egl2, cbhl and cbhl genes of T. reesei, respectively (Penttili et al. 
- 1987. 1988). The yeast expression vector P FL60 (Minet and Lacroute 1990) containing the 
constitutive yeast PGK promoter and terminator. URA3 marker gene and the 2 micron 
plasmid replication origin was kindly provided by Dr. M. Minet. Centre de Genetique 
25 Moleculaire. C.N.R.S.. France. 



20 
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Example 1 

Isolation of endoglucanase gene by expression in yeast and bydrolytic properties of the 
yeast 

5 T. reesei strain QM9414 was cultivated in a 10 liter fennentor at 28 °C and pH 4.0 for 42 
hours. The cultivation medium used to induce bydrolytic enzyme production contained 2 % 
Solka floe cellulose, 1 % distiller's spent grain, 0.2 % Locust bean gum -galactomannan 
(Serva), 0.5 % KH 2 P0 4 and 0.5 % (NH^SCV After 42 hours of growth, lactose (Sigma), 
Birke 150 acetylglucuronoxylan and Oat spelt arabinoxylan were added in an amount of 0.1 * 
10 % each and the cultivation was continued for further 24 hours. 

Total RNA from the T. reesei strain was isolated as described by Chirgwin et al. (1979), and 
the poly(A) + fraction was separated by chromatography through oligo(dT)-cellulose (BRL). 
cDNA, synthesized by the ZAP-cDNA synthesis kit (Stratagene), was ligated to the EcoW- 
15 - Xhol cut plasmid pAJ401. Plasmid pAJ401 was derived from plasmid pFL60 (Minet and 
Lacroute 1990) by changing the two cloning sites EcoRl and Xhol between the yeast PGK 
promoter and terminator into the reverse orientation. Transformation of £. colt strain PLK-F' 
by electroporation (Bio-Rad) according to the manufacturer's instructions yielded a library of 
3.5 x 10* independent clones. Plasmids were isolated from the pool of £. coli transfonnants 
20 and transformed into 5. cerevisiae strain DBY746 by electroporation (Bio-Rad) according to 
the manufacturer's instructions. Electroporation with 7 pg of plasmid DNA yielded a library 
of 8 x 10* yeast transfonnants. 

1.2 x 10 5 yeast cells were plated on barley fl-glucan^ontaining plates to a density of 2000 
25 colonies / 85 mm plate and grown at 30 °C for 3 days. Colonies were replicated and the 
original plates stained with Congo Red. Unstained areas around yeast colonies indicate 
hydrolysis of the substrate to oligosaccharides. Colonies showing activity were picked up 
from the replica plates and purified on new activity plates. Plasmids were recovered from the 
purified clones and analysed by restriction enzyme digestions. 20 clones gave a similar 
30 pattern of bands which was clearly different from the earlier isolated cellulase genes of T. 
reesei. 



Printed from Mimosa 04/26/2000 



PCT/FI94/00234 

WO 94/28117 

19 

Transformation of the plasmids back to yeast confirmed that the activities were caused by the 
cDNA inserts. One of these plasmids, pAS4 (cf. Figure 3). was studied further. The insert in 
the pAS4 plasmid was named egl5 and the corresponding protein EGV. 

5 egl5 cDNA was sequenced from both strands of the original P AS4 plasmid using the Sanger 
dideoxynucleotide method, T7 DNA polymerase (Pharmacia) and sequence specific primers. 

The sequence obtained is shown in SEQ ID NO. 1. 



10 



15 



20 



25 



30 



The chromosomal egl5 gene was isolated from a T. reesei cosmic library (Mantyla, A. et al. 
Curr. Genet. 1992, 21 471-477) by using the eglS cDNA as a probe. About 6 kb Hind®. 
fragment was subcloned to pBluescript SK, resulting in plasmid P AS13 (Fig. 7a). The 
toons and coding sequence of egl5 gene are shown in Figure 10 (SEQ ID NO. 11). 

The activities of the yeast strain DBY746 carrying the pAS4 plasmid were studied by plate 
assays and they were compared with the activities of the yeast strains producing CBHI. 
CBHII, EGI and EGII. 

Hydrolytic activities produced by recombinant yeast cells were detected on SD plates con- 
taining 0.1 % barley 0-glucan (0-D-l,3-l,4-glucan, viscosity 20-30 c.s.; Biocon, UK, 
Sherman 1991) or hydroxycthyl cellulose (HEC, Fluka, Switzerland, product 54290). After 
growth the plates were stained with Congo Red (Merck) as described by Pentrila et al. 
(1987) to reveal the hydrolysis halos. Xylanase activity plates containing 0.2 % of a Remazol 
Brilliant Blue-dyed derivative of xylan (RBB-xylan. Sigma) needed no further treatment. 
Activities against synthetic substrates. 4-methylumbelliferyl 0-D^ellobioside (MUC; Koch- 
light. UK) or 4-methylumbelliferyl /3-D-lactosidc (MUL; Umbda Probes & Diagnostics. 
Austria) were detected as described by Penttili et al. (1987). 

The EGV protein showed a clear activity against /3-glucan but the activity was lower than the 
activities of the strains producing EGI. CBffll or EGII (Table). However, the expression 
,evels and the secretion efficiencies of foreign proteins in yeast may vary and thus u ,s not 
possible to draw any definite conclusions concerning the level of enzyme activity. Also, the 
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pH on the plates is not optimal for EGV function. EGV shows some activity against 
hydroxyethyl cellulose (HEC) in plate assays. No activity was detected on plate assays 
towards RBB-xylan or the small synthetic substrates, methylumbelliferyl cellobioside (MUG) 
or methylumbelliferyl lactoside (MUL). 

Table 1 Hydrolytic activities of the yeast strains carrying the cellulase genes of 

Trichoderma reesei. The extent of hydrolysis of the substrate was estimated 
visually and is indicated by + . 





EGV 


EGI 


EGE 


CBHI 


CBHn 


0-glucan 


+ + 


+ + + + + 


+ + + 




+ +++ 


HEC 




+ + + + 








MUL 




+ + + 




+ + 




MUC 




+ + + 








RBB-xylan 




+ + + 









Example 2 

Construction of endoglucanase expression vectors with truncated fragments of the chbl- 
promoter 

25 The vector pML016 (Figure 5) contains a 2.3 kb cbhl promoter fragment (SEQ ID 4) 

starting at 5' end from the EcdRl site, isolated from chromosomal gene bank of Trichoderma 
reesei (Teeri et al, 1983), a 3.1 kb BamHl fragment of the lacL gene from plasmid pAN924- 
21 (van Gorcorn et al., 1985) and a 1.6 kb cbhl tenninator (SEQ ID 5) starting from 84 bp 
upstream from the translation stop codon and extending to a BamHl site at the 3' end (Shoe- 

30 maker et al. 1983; Teeri et al., 1983). These pieces were linked to a 2.3 kb long EcoRI- 
Pvull region of pBR322 (Sutcliffe, J.G., 1979) generating junctions as shown in Figure 5. 
The exact in frame joint between the 2.3 kb cbhl promoter and the 3.1 kb lacZ gene was 
constructed by using an oligo depicted in Figure 5. A polylinker shown in Figure 5 was 
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cloned into the single internal Xbal site in the chbl promoter for the purpose of promoter 
deletions. A short &zfl linker shown in Figure 5 was cloned into the joint between the 
P BR322 and cbhl promoter fragments so that the expression cassette can be released from 
the vector by restriction digestion with &fl and Sphl. Progressive unidirectional deletions 
5 were introduced to the cbhl promoter by cutting the vector with Kpnl and Xhol and using die 
Erase-A-Base System (Promega. Madison. USA) accordip to manufacturer's instructions. 
Plasmids obtained from different deletion time points were transformed into the E. coli strain 
DH5« (BRL) by the method described in (Hanahan D, 1983) and the deletion end points 
were sequenced by using standard methods. 

Example 3 

Construction of vectors for expression of EGV in Trichoderma in glucos^ontaining 
medium 

15 In order to produce EGV protein in Trichoderma reesei QM9414 strain essentially free of 
other cellulases in a medium containing glucose, the plasmid P AS16 (Fig. 4) was construe 
«d. mere, the eglS cDNA was cloned under the truncated, glucose derepressed cbhl promo- 
ter of the plasmid P ML016del5(ll), generated as explained in Example 2. The plasmid 
contained a 11 10 bp deletion in the cbhl promoter beginning from the promoter internal 

20 polylinker and ending 385 bp before the translation initiation site (Fig. 5). The sequence of 
mis truncated promoter is provided as SEQ ID NO. 6. Plasmid P ML016del5(ll) was diges- 
ted with the restriction enzymes Kspl and Smal. The vector part containing the glucose- 
derepresscd cbhl promoter, the cbhl terminator and tin, P BR322 sequence was blunt-eoded 
with the Mung bean nuclease, dephosphorylated with Calf intestto alkaline phosphatase and 

25 ligated to the egl5 cDNA fragment. 

The yeast expression plasmid P AS4 was digested with EcoBl and partially with Xhol to 
isolate the full-length egl5 cDNA. The ends of the cDNA were fuled-in with the Klenow 
polymerase enzyme and the fragment was ligated into the SmaKleaved vector pSP73 (Pro- 
30 mega). The resulting plasmid pASll was digested with EcoK and Xbal. ftiled-in with the 
Klenow polymerase and ligated to the vector part of the expression vector P ML016del5(ll). 
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Twenty micrograms of the pAS16 plasmid were digested with EcoKL and SphL, phenol- 
extracted, precipitated and transformed into Trichoderma reesei QM9414 together with three 
micrograms of the plasmid p3SR2 (Hyncs et al t 1983) containing the acetamidase gene 
according to Penttila et ai> (1987). 

5 

The promoter of the cDNAl gene (Nakari et aL, 1992) was also used to direct the synthesis 
of the EGV protein on glucose-containing medium. 

The promoter of the cDNAl gene was cloned from the chromosomal DNA by PGR using the 
10 5'primer GGT CTG AAG GAC GTG gaa TGA TGG (SEQ ID NO. 7) and the 3'primer gat gca 

TCG ATC GT C CGC GG G TTG AGA GAA GTT GTT GGA TTG ATC AAA AAG (SEQ ID NO. 8). The 

underlined ATCGAT in the 3'primer is a CM site and the CCGCGG a Kspl site. 

The eglS cDNA and the cbhl terminator were cloned as one fragment from the plasmid 
15 pAS16 by PCR using the 5 'primer gag aga ccg cgg tga tct tcc atc tcg tgt err gct tgt 
a ac (SEQ ID NO. 9) and the 3 'primer atc gtg gat cca tta tta aca ctt cgg tgg (SEQ ID 
NO. 10). The underlined CCGCGG in the 5 'primer is a Kspl site. 

Eight micrograms of both of the fragments were digested with the Kspl enzyme, purified from 
20 agarose gel and ligated. The ligation mixture was extracted with phenol, precipitated and used 
- instead of a plasmid in the Trichoderma transformation together with three micrograms of the 
p3SR2 plasmid. 

The Amd* transformants from the pAS16 transformation were streaked twice onto plates 
25 containing acetamide (Penttila et aL, 1987), and then cultivated on Potato Dextrose Agar 

plates (Difco) from which spore suspensions were made. EGV production was tested from 50 
ml shake flask cultures carried out in irunimal medium according to Penttila et al., (1987) 
except that the amount of glucose was 4 %, KH 3 P0 4 3 %, K 2 ?0 4 0.8 %, (NH 4 ) 2 S0 4 0.2 % 
and the medium was supplemented with 0.2 % peptone. Glucose was added as 15 % solution 
30 when necessary to keep the level above 1 % during the whole four days of the cultivation. 
The culture supernatants of 55 transformants were analyzed for activity against barley p- 
glucan by the DNS-method (Zurbriggen et aL, 1990). 
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The ^rc suspensions of tne three best EGV-producing clones (numbers 101, 79 and 19) 
were purified to single spore cultures on Potato Dextrose Agar plates. EGV production was 
analyzed again from these purified clones as described above. The best producing transfor- 
mant 101c was analysed by Southern blotting using conventional methods and the presence of 
the expression casette in the genomic DNA was confirmed Northern analysis showed that the 
eg 15 gene was expressed from the constructs on glucose medium. 



Example 4 

10 Construction of EGV expression plasmid pALK956 

The expression plasmid pALK956 (Figs. 7d) contains: 

1) T. reesei eglS gene fused to the cbhl promoter. A fragment containing the cbhl terminator 
15 was included after eg!5 to ensure stop in the transcription. 

2) E. coli hph (hygromycin B phosphotransferase; Gritz and Davies, 1983) as a marker gene 
for transformation. The gene was expressed from the T. reesei pU (pyruvate kinase; Scbin- 
dler et aL, 1993) promoter. 

3) Elongated cbhl tenninator as a flanking region to ensure stop mpkl transcription and to 
20 target the expression cassette, together with the cbhl promoter fragment, to the cbhl locus. 

The construction of P ALK956 is shown in detail in Figs. 7a - 7d. For the construction, the 
plasmids pASl 1, pAS13, pALK487 and P ALK183 were used. The plasmid pASll contains 
the eglS cDNA (Fig. 1) and pAS13 contains the chromosomal eglS gene (Fig. 10). The 

25 plasmid pALK487 contains the T. reesei cbhl promoter (the 2.2 kb Stul - SacU fragment 

originally from the plasmid pAMHUO; Nevalainen et aL, 1991) and cbhl temunator (the 0.7 
kb AvaU fragment starting 1 13 bp before the stop codon of the cbhl gene; for the cbhl 
- sequence, see Shoemaker et al., 1983). The plasmid pALK183 contains hph gene under the 
control of the pki promoter. It was constructed from P RLM„30 (Mach et aL, 1994) by 

30 changing the cbh2 terminator to 1.6 kb cbhl elongated terminator (AvaU - BanM fragment). 

The exact fusion of the eglS gene to the cbhl promoter was done by PCR. The 5'-primer 
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contained the last 26 nucleotides of the cbhl promoter including the Sacll site and the first 18 
nucleotides of the coding sequence of eglS (5 '-CAATAGTCAACQQCfifiACTGCGCATCAT- 
GAAGGCAACTCTGGTT; the Sacll site is underlined, eglS sequence is bolded). The 3'- 
primer contained 21 nucleotides of egl5 sequence, including the BamHl site about 0.7 kb from 
5 the beginning of egl5 sequence (5* -GGGCGTGGGATCCGTCTCTTG; the BarnM site is 
underlined). The plasmid pAS13 was used as a template in the PCR reaction. 

The 0.7 kb PCR fragment (filled in with DNA polymerase I Klenow fragment and cut with 
Bamffi), containing the exact link between the cbhl promoter and the eglS gene, was ligated 

10 to Pvull - BamK digested pASl 1 to obtain pALK951. The fusion and the PCR fragment 
were sequenced to ensure that no mistakes had occurred in the PCR amplification. Plasmid 
pALK955, containing the fusion of the egl5 to the cbhl promoter, was obtained by ligating 
£coRI/Klenow - Sacll fragment from pALK951 between the cbhl promoter and terminator in 
the plasmid pALK487 (ZtomHI/Klenow - SkcII). The hph marker gene (under the control of 

15 the pki promoter) and the cbhl 3 '-flanking region (elongated terminator) were ligated to Stul 
cut pALK955 from pALK952 (Xhol - Hindlll fragment / Klenow) to construct pALK956. 

The plasmid pALK952 was constructed from pALK183 by shortening the pki promoter 
compared to the promoter used in pALK183 and pRLM^O (//ofl/partial - Xhol, Klenow). 

20 

The 7.4 kb expression cassette from pALK956 can be removed with Notl digestion. 

Thus, in summary, in the expression plasmid pALK956, the eglS gene is fused to the cbhl 
promoter. The E. coli hph (hygromycin B phosphotransferase) gene is used as a marker for 
25 the transformations. The cbhl 3 '-flanking region (elongated terminator) is included to ensure 
stop in the pki transcription and to target the expression cassette, together with the promoter 
_ fragment, to the cbhl locus. 

Example 5 

30 Expression of EGV under the cbhl promoter in cellulase-inducing medium 

The EGV expression plasmid, pALK956, was digested with Notl, and the 7.4 kb fragment 
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was purified from agarose gel. 2-3 „ of the linear fragment .as transformed tnto T rees, 
strains ALK02221 and ALK03524 according to Penttil* et al. (1987) with the morons 
described in Karhunen et a!. (1993). ALK02221 is a low protease mutant from T. reese, 
VTT-D-79125 (Bailey and Nevalainen, 1981), prepared in our laboratory (A. Mantyla). 

. , • j ft-rtm VTT-D-79125 where the cbh2. egl2 and egll genes have 
ALK03524 is a strain derived from VTT-D-m^, wnc e 

been deleted using the A. Nutans trpC (Yelton et al., 1984), A. nUMans amtS (Kelly and 
Hynes 1985) and Strep.oalloreickus Hindustan* P Hle<f (Mattem et al., 1987) marker genes, 
respectively. The method of one-step gene replacement with a linear fragment and flankmg 
regions of the corresponding cellulose locus is described in Suominen et al. (1993). 

HygB + transforms were selected on plates containing T. reesei minimal medium (Penttila 
et al 1987) with 100 ^ hygromycin/ml. Transforms were purified by smgle spore 
selection on selective medium and then cultivated on Potato Dextrose Agar. Punfied 
transforms were grown in shake flasks in a medium containing 4 % whey, 1.5 * complex 
ni^gen source derived from grain, 1.5 « KH jP 0 4 and 0.5 % (NW Cultures were 
^ntained at 30 -C and 250 rpm for 7 day, The culture supematants were ana^ed for 
activity against barley p-glucan at P H 6.3 by the DNS-method (Zurbriggen et al., 1990). 
Soluble protein was assayed by the method of Lowry et al. (1951) using bovine serum 
albumin as standard. The detection of the 67 kDa CBHI protein was done in SDS-PAGE 
followed by Coomassie Brilliant Blue staining. The results from the best EGV transformants 
and the corresponding host strains are shown in Table 2. In the EGV-transformants the 
p-glucanase activity measured at the optimum P H of EGV was enhanced about twofold. 
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Expression of EGV under the cbhl promoter in cellulase-inducing medium 



5 Strain Protein p-glucanase CBHI 

(mg/ml) activity (BU/ml) (+/-) 



15 



20 



ALK02221 


7.1 


1945 




EGV/ALK02221/11 


5.2 


3262 




EGV/ALK02221/47 


5.5 


3236 




EGV/ALK02221/31 


3.5 


3757 




EGV/ALK02221/68 


3.9 


3479 


+ 


ALK03524 


9.3 


3338 


+ 


EGV/ALK03524/27 


5.3 


6770 


+ 


EGV/ALK03524/28 


7.6 


7588 


+ 


EGV/ALK03524/31 


7.0 


6622 


+ 



Example 6. 

Enzyme preparation containing EGV protein obtained from yeast where eglS gene was 
25 expressed. 



Saccharomyces cerevisiae DBY 746 containing the pAS4 plasmid was grown in a bioreactor 
(Chemap LF 20, working volume 16 1) on a standard YPD medium. The inoculum (5 times 
30 200 ml) was grown in shake flasks in selective synthetic complete medium without uracil. 

Cultivation conditions were: temperature 30 °C, pH controlled between 5.2 and 5.9, aeration 
about 15 1 min' 1 and cultivation time 45 h. The yeast cells were separated from the medium 
by centrifugation and the culture supernatant was concentrated 4-fold by ultrafiltration (PCI 
ES 625 membranes). 

The enzymatic activity in the concentrate was assayed by standard methods using appropriate 
incubation times for the enzyme reaction against p-glucan (Zurbriggen et al. t 1990a) and 
hydroxyethyl cellulose, HEC (IUPAC, 1987). The p-glucanase activity was 0.7 nkat ml* 1 and 
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endoglucanase (HEC) activity less than 0.4 nkat ml '. No endoglucanase activity could be 
detected in culture filtrates of control cultivations of yeast missing the EG V gene (£ 
cerevisiae DBY 746 carrying the plasmid pAJ 401). 

The purified EGV preparation can be obtained from the ultrafiltration concentrate by standard 
protein chromatography method, The EGV protein can be bound to an anion exchanger resm 
(e g. Mono Q columns or DEAE Sepharose FF, Pharmacia) in low ionic strength buffer and 
at appropriate P H. The protein can be eluted out of the column using increasing gradient of 
NaCl (e.g. from 0 to 0.5 M in the buffer of binding). Alternatively, the impurities from the • 
preparation of EGV can be removed by binding them in anion exchange resin at appropriate 
pH and ionic strength where EGV is not bound to the resin. Cation exchanger resms (e.g. 
Mono S columns or CM Sepharose FF, Pharmacia) can be used in analogous way by selecting 
buffers of appropriately low P H (e.g. pH 4 - pH 6). EGV can also be purified by gel 
permeation chromatography where it can be separated due to its small molecular size. The 
columns of various materials (e.g. Sephacryl S-100 HR or various types of Sepharose and 
Superose, Pharmacia; Fractogel TSK HW-55, Merck) in e.g. phosphate or acetate buffers 
containing e.g. 0.05 - 0.5 M NaCl can be used. Hydrophobic interaction chromatography and 
various affinity chromatography methods may also be used. 



20 



25 



30 



Example 7 

Enzyme preparation containing EG V protein obtained from TricHcHerma reesd grown 



on glucose. 



One of the best producing T. reesei QM9414 transforms (number QM/lOlc) was grown in 
a bioreactor (Chemap LF 20, working volume 1 6 1) on a medium of Mandels and Weber 
(1969) where Solka floe cellulose (10 g l ') was replaced by 20 g 1" of glucose and where 
the concentrations of other nutrients were correspondively doubled. The inoculum (5 times 
200 ml) was grown in shake flasks in a medium containing 40 g 1" glucose and the adequate 
mineral salts for nutrients and buffering of the medium. Cultivation conditions were: tempera- 
ture 29 'C, P H controlled between 4.0 and 5.0, aeration about 15 1 min" and cultivation time 
93 h During the cultivation glucose concentration in the fermentor was maintained above 5 g 



printed from Mimosa 04/26/2000 



WO 94/23117 



PCT/FT94/00234 



28 

1"' by adding continuously sterile glucose (40 g I" 1 ) solution. The mycelium was separated 
from the medium by centrifugation and the culture supernatant was concentrated 1 .6 times by 
ultrafiltration (PCI ES 625 membranes). 

5 The clarified supernatant was first fractionated by hydrophobic interaction chromatography. 

The pH of the sample was was adjusted to pH 6.0 and conductivity of the sample to the value 
corresponding to 10 mM sodium phosphate buffer, pH 6.0, containing 1.25 mol 1"' (NH 4 ) 2 S- 
0 4 . The sample was applied to a column (113x110 mm) of Phenyl Sepharose FF (Pharma- 
cia), previously equilibrated with 10 mM sodium phosphate buffer, pH 6.0, containing 1.25 • 
10 mol I* 1 (NHJjSCV Elution was started by the equilibrating buffer followed by a linear dec- 
reasing gradient of ammonium sulphate from 1.25 M to 0 M. Fractions (each 450 ml) which 
contained the major endoglucanase activity were combined, eluted at the end of the decreasing 
gradient and by 10 mM phosphate buffer. The other adsorbed proteins were eluted by distilled 
water and the column was washed with 6 M urea. 

15 

The enzyme preparation obtained in the first chromatographic step was equilibrated to 4 mM 
sodium phosphate, pH 7.2 by gel filtration (Sephadex G-25 coarse). The equilibrated protein 
solution was applied to a column (1 13 x 190 mm) of DEAE Sepharose FF (Pharmacia), pre- 
equilibrated with the same buffer. Elution was performed first with the equilibrating buffer to 
20 remove unadsorbed proteins and thereafter by stepwise additions of sodium chloride to con- 
centration of 200 mM. Fractions (each 900 ml) which contained the endoglucanase and 
which eluted by 200 mM NaCI were collected and the fraction with the highest activity was 
concentrated by ultrafiltration (Amicon PM-10 membranes). The specific activity of the 
preparate was 360 nkat mg" 1 protein and purification factor of ca. 30 was obtained when 
25 compared to the supernatant of the fermentor liquid. 

The preparate was further characterized by isoelectric focusing on PBE-94 anion exchange 
material (Pharmacia). The column was equilibrated by 25 mM imidazole-HCl buffer, pH 7.4 
and elution was carried out by Polybuffer 74 (Pharmacia) - HCI buffer, pH 4.0 according to 
30 the manufacturer's instructions. EGV, measured by P-glucanase activity, eluted from the 
column at pH 6.6 - 7.2. 
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Example 8 

pH-Optimum of EGV 



,.0 % solutions of barley P-glucan (Megazyme, Australia) was prepared in Mdlvaine buffers 
diluted one to four (corresponding ca. 50 mM citrate-phosphate buffer), in P H range from 3.0 
to 8 0 Activity of an enzyme sample prepared as described in Example 7 was assayed ustng 
as substrate the P -glucan solutions prepared in the varying pH-values. The assay procedure 
was otherwise similar to the procedure of endoglucanase assay (JUPAC. 1987). Incubation 
time in the assay was 10 min at 50 -C. after which the enzyme reaction was terminated by . 
boiling. Reducing sugar groups formed in the reaction were measured by DNS-reacUon. 



The pH-optimum of EGV was 6.0 - 6.5 (Figure 8). 



15 Example 9 

pH-Stability of EGV 



20 



25 



An EGV sample was prepared as described in Example 7, except that the last concentration 
by ultrafiltration was omitted (activity 48 nkat/ml, assayed at pH 6.3 against barley P-glucan, 
analogously to endoglucanase assy, IUPAC, 1987). This sample was diluted (1 part per 2 
parts of buffer) by 100 mM buffers of sodium acetate and sodium phosphate, prepared » 
different pH values. The diluted samples were incubated at 40 -C fox 20 h, and the activity 
was assayed as described earner. The pH of incubation was measured after the incubaUon. 

More than 80 % of the original activity was observed in the samples incubated at pH range 
from ca. pH 5.4 to ca. pH 6.8. The relative recovered activity is presented in Ftgure 9. 



Example 10 

30 Hydrolysis of insoluble cellulosic substrates by EGV. 

Avicel (Serva 14204) which is mainly crystalline cellulose and phosphoric acid-swollen 
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amorphous (Walseth, 1952) cellulose were hydrolysed by the new endoglucanase enzyme 
(EGV) alone and in combinations with the previously known cellobiohydrolases of T. reesei. 
The preparation of EGV was obtained as described in Example 7. CBHI and CBHII were 
purified from culture filtrates of Trichoderma reesei and were pure proteins as judged by 
5 SDS-PAGE. As a control, the cellobiohydrolases were incubated without the addition of EGV. 
The reducing sugars liberated in the treatments were assayed using the DNS method and the 
reaction products were analysed by HPLC. 

The substrates for the hydrolysis were prepared in 50 mM sodium citrate buffer, pH 5.8 in . 

10 concentration of 10 g 1*'. EGV was dosed on the basis of activity against p-glucan at pH 5.8 
(500 or 2000 nkat g* 1 substrate) and cellobiohydrolases (CBHI and CBHII) on the basis of 
protein (1.0 or 4 mg g' 1 substrate). The reaction mixtures were incubated for 20 h at pH 5.8 at 
40 °C after which the hydrolysis was terminated by boiling. The values for reducing sugars as 
glucose assayed from the reaction mixture are presented in Tables 3 and 4. The enzyme 

15 dosage was 500 nkat g" 1 substrate for EGV and 1.0 g g' 1 substrate for CBHI and CBHII 

(Table 3), and 2000 nkat g' 1 substrate for EGV and 4.0 g g" 1 substrate for CBHI and CBHII 
(Table 4). Duration of the hydrolysis was 20 h in both cases. 

The major hydrolysis product of EGV was cellobiose but also cellotetraose was detected in 
20 the hydrolysate by HPLC. The strong synergy of EGV with CBHI in the hydrolysis of these 
substrates can be clearly seen. Even though EGV released only small amounts of soluble 
sugars the enhancing effect on the cellulose hydrolysis by CBHI was remarkable. 

25 Table 3. Reducing sugars liberated by cellulases from T. reesei in the hydrolysis of 



crystalline cellulose (Avicel) and amorphous (Walseth) cellulose 



enzymes 



reducing sugars as glucose (mg ml'' 



Avicel 



Walseth 



30 



EGV alone 



0.00 



0.01 (*) 



CBHI alone 



0.09 



0.16 
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CBHI1 alone 0-23 
CBHI and EGV 0.15 
CBHII and EGV 0-23 

(*) determined by HPLC 



10 Table 4. Reducing sugars liberated by celluloses from T. reese, in the hydrolysis of crystalli- 
ne cellulose (Avicel) and amorphous (Walseth) cellulose 



15 



20 



enzymes reducing sugars as glucose (mg ml"') 

Walseth 



EGV alone 


0.03 (*) 


0.05 


CBHI alone 


0.35 


0.49 


CBHII alone 


0.43 


1.44 


CBHI and EGV 


0.43 


0.71 


CBHII and EGV 


0.47 


1.67 


EGV (larger dosage 


0.06 


0.13 


(**) 







(*) determined by HPLC 
25 (**) dosed activity 5000 nkat g*' substrate 



Example 11 

30 Hydrolysis of soluble ceUulosic substrates by EGV 



35 



HEC (hydroxyethyl cellulose, Fluka 54290) which is a soluble substituted cellulose polymer 
and barley M»can (Megazyme. Australia) were hydrolysed by the new endoglucanase 
enzyme (EGV) alone and in combinations with two previously known endoglucanases of Z 
reesei The preparation of EGV was obtained as described in Example 7. Endoglucanases EGI 
and EGII were purified from culture filtrates of Trichoderma reesei and were pure protems as 
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judged by SDS-PAGE. As a control, the endoglucanases were incubated without the addition 
of EGV. The reducing sugars liberated in the treatments were assayed using the DNS method. 

The substrates for the hydrolysis were prepared in 50 mM sodium acetate buffer, pH 5.8 in 
5 concentration of 10 g I"'. Endoglucanases were dosed on the basis of activity against P-glucan 
(dosage for each: 100 nkat g* 1 substrate) The reaction mixtures were incubated at 40 °C after 
which the hydrolysis was terminated by boiling. The values for reducing sugars as glucose 
assayed from the reaction mixture for HEC are presented in Table 5, and for P-glucan in 
Table 6. The enhancing effect of EGV on the hydrolysis of HEC and especially p-glucan can 
10 clearly be seen. 

Table 5. Reducing sugars liberated by endoglucanases from T. reesei in the hydrolysis of 
hydroxyethyl cellulose. 



enzymes 


hydrolysis products as reducing sugars (mg ml" 1 ) 


2 h hydrolysis 


20 h hydrolysis 


EGV alone 


0.00 


0.00 


EGI alone 


0.14 


0.26 


EGII alone 


0.18 


0.29 


EGI and EGV 


0.16 


0.27 


EGII and EGV 


0.24 


0.29 
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Table 6. Reducing sugars liberated by endoglucanases from T. reesei in the hydrolysii 
barley p-glucan. Duration of hydrolysis 2 h. 



5 



enzymes 


hydrolysis products as 
reducing sugars (mg ml* 1 ) 


EGV alone 


0.96 


EGI alone 


1.1 


EGII alone 


0.90 


EGI and EGV 


2.4 


EGII and EGV 


2.0 



Example 12 

15 Modelling of the cellulose-binding domain of EGV 

Sequences of the five cellulose-binding domains (CBD) of the T. reesei celluloses were 
aligned to define conserved regions using the computer program MALIGN (Johnson et al, 
1993; Johnson and Overington, 1993), which is suitable for this purpose because the percenta- 

20 ge identity between the five CBDs is -60 %. The construction of a 3-D model of the EGV 

CBD was performed using the COMPOSER method (Sutcliffe et al, 1987a,b; Blundell et al., 
1988; Sali et al., 1990), which is based on rules derived from known three-dimensional 
structures. These rules can be used to define a conserved core for the model, to select appro- 
priate fragments for the variable regions and to replace the side chains. The NMR-based 

25 structure of the CBHI CBD (Kraulis et al., 1989) was used as a basis for the EGV model. 

The computer program CHARMm ver. 22 (Brooks et al, 1983) was used to soak the comple- 
ted model in a 35 A cubic box of water and to refine the model through energy irommization 
and molecular dynamics simulation of 100 ps under periodic boundary conditions. 

30 The sequence alignment shows that the CBDs of T. reesei are highly conserved except for one 
insertion and one deletion of a single aa in EGV. Therefore most parts of the 3-D structure of 
the CBHI CBD, determined by NMR (Kraulis et al, 1989), could be used as a conserved core 
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for modelling of the EGV CBD by the computer program COMPOSER. The CBHI CBD is a 
wedge-shaped domain having two flat surfaces. One of these is predominantly hydrophilic and 
contains three tyrosine residues that have been shown by chemical modification to be impor- 
tant for the binding of the enzyme to cellulose (Claeyssens and Tomme, 1989). Tyr 492 located 
at the tip of the wedge has also been demonstrated by siterdirected mutagenesis to be involved 
in substrate binding (Reinikainen et al % 1992). This residue is replaced by a tryptophan 
(Trp 136 ) in the EGV CBD (Fig 3B), an amino acid substitution also seen in many other fungal 
CBDs. Both tyrosine and tryptophan residues interact readily with carbohydrates. 

The backbones of the CBHI and EGV CBDs are very similar. Two disulfide bridges in 
identical positions stabilize the structures. The insertion and the deletion in EGV are situated 
in a single loop and thus compensate each other, maintaining the loop backbone unchanged 
compared with that of CBHI. However, there is an interesting difference in the backbone 
conformation at the other, more hydrophobic, flat face. A substantial change in torsion angle 
was observed at position Gly 220 of EGV, where the $-angle of the glycine residue changes 
from negative to positive during the refinement simulation. This causes the loop at region 
217-221 to be pushed outwards. Interestingly, the corresponding loop in the CBHI CBD 
contains a proline residue (Pro 4 * 4 ), mutation of which reduces the activity of CBHI against 
crystalline cellulose (Reinikainen et al. t 1992). Simulations of the other CBDs from T. reesei 
show that this region is the most flexible region of the CBD (A.-M. Hoffren, T.T. Teeri, and 
O. Teleman, submitted). 

There are also moderate differences in the backbone regions, in which proline (Pro 491 ) and 
serine (Ser 482 ) residues of the CBHI CBD are replaced by glutamine (Gin 235 ) and proline 
(Pro 227 ) residues in the EGV CBD, respectively. Moderate differences are also found in side 
chain conformations, but these changes are within the range of fluctuation occurring during 
simulation. One of the tyrosines (Tyr 210 ) forming the hydrophilic face of EGV CBD points 
more upwards than its counterpart (Tyr 4 * 6 ) in CBHI. This difference in orientation is signifi- 
cant but the flexibility allows Tyr 210 in EGV to occupy the same position as Tyr 466 in CBHI. 
Thus the difference in orientation is unlikely to affect substantially the affinity for cellulose. 
Overall, the EGV CBD seems to be less wedge-shaped and the hydrophobic surface more 
rounded than that of the CBHI CBD. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: OY ALXO AB 

(B) STREET: Salmisaarenranta 7 H 

(C) CITY: Helsinki 
10 (E) COUNTRY: Finland 

(F) POSTAL CODE: FIN-00180 

(ii) TITLE OF INVENTION: Novel Endoglucanase Enzyme 
15 (iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER READABLE FORM 

(A) MEDIUM TYPE: Floppy disc 

(B) COMPUTER: IBM PC compatible 

20 (C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Releae #1.0, Version #1.25 (EPO) 

(v) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: FI S32521 
25 (B) FILING DATE: 2-JUNE-1993 

(2) INFORMATION FOR SEQ ID NO: 1: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
40 (iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichoderma reesei 

(B) STRAIN: QM9414 



45 



50 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 40 . . 765 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GATCTTCCAT CTCGTGTCTT GCTTGTAACC ATCGTGACC ATG AAG GCA ACT CTG 54 

Met Lys Ala Thr Leu 

55 15 

GTT CTC GGC TCC CTC ATT GTA GGC GCC GTT TCC GCG TAC AAG GCC ACC 102 
Val Leu Gly Ser Leu lie Val Gly Ala Val Ser Ala Tyr Lys Ala Thr 
10 15 20 

60 

ACC ACG CGC TAC TAC GAT GGG CAG GAG GGT GCT TGC GGA TGC GGC TCG 150 
Thr Thr Arg Tyr Tyr Asp Gly Gin Glu Gly Ala Cys Gly Cys Gly Ser 
25 30 35 

65 AGC TCC GGC GCA TTC CCG TGG CAG CTC GGC ATC GGC AAC GGA GTC TAC 198 

Ser Ser Gly Ala Phe Pro Trp Gin Leu Gly lie Gly Asn Gly Val Tyr 
40 45 50 
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10 



15 



25 



30 



35 



45 



55 



60 



65 



ACG GCT GCC GGC TCC CAG GOT "CTC TTC GAC ACG GCC GGA GOT TCA TGG 
tS £2 S Gly Ser Gin Ala Leu Phe Asp Thr Ala Gly Ala Ser Trp 

55 €0 65 

TTC GGC GCC GGC TGC GOT AAA TGC TAC CAG CTC ACC TCG ACG GGC CAG 

S S ay *y *v* Qln ^ ™ r ser Thr Gly G 85 

70 7 5 80 

rrr CCC TGC TCC AGC TGC GGC ACG GGC GGT GCT GCT GGC CAG AGC ATC 
£a pro ?ys Ser £r Cys Gly Thr Gly Gly Ala Ala Gly Gin Ser lie 
90 95 

ATC GTC ATG GTG ACC AAC CTG TGC CCG AAC AAT GGG AAC GCG CAG TGG 
S 55 £2 ™ Thr Asn Leu Cys Pro Asn Asn Gly Asn Ala Gin Trp 
105 110 115 

TGC CCG GTG GTC GGC GGC ACC AAC CAA TAC GGC TAC AGC TAC CAT TTC 
Cys Pro Val Val Gly Gly Thr Asn Gin Tyr Gly Tyr Ser Tyr His Phe 



135 



TTT GAG CCC ATT GCT TGC CCC GGG CAG GCT GCC TCT GAC TGG GGG ACG 
p£ SS Pro lie Ala Cys Pro Gly Gin Ala Ala Ser Asp Trp Gly Thr 
ISO 155 160 165 

TGC CTC TGC GTG GGA CAG CAA GAG ACG GAT CCC ACG CCC GTC CTC GGC 
Leu Sys Val Gly Gin Gin Glu Thr Asp Pro Thr Pro Val Leu Gly 

170 I 75 
AAC GAC ACQ GGC TCA ACT CCT CCC GOG AGC TOG CCG CCA GOT ACA TCG 

Asn Asp Thr Gly Ser Thr Pro Pro Gly Ser Ser Pro Pro Ala Thr Ser 

IBS 190 19 

TCG AGT CCG CCG TCT GGC GGC GGC CAG CAG ACG CTC TAT GGC CAG TGT 

IS Ser Pro Pro Ser Gly Gly Gly Gin Gin Thr Leu Tyr Gly Gin Cye 

200 205 210 



215 



(2) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Ala Thr Leu Val Leu Gly Ser Leu lie Val Gly Ala Val Ser 



246 



294 



342 



390 



43B 



120 

20 rar arc ATG GCG CAG AAC GAG ATC TTT GGA GAC AAT GTC GTC GTC GAC 486 

lie £2 22 £m Su lie Phe Gly Asp Asn Val Val Val Asp 
* 140 145 



534 



582 



630 



678 



GGA GGT GCC GGC TGG ACG GGA CCT ACG ACG TGC CAG GCC CCA GGG ACC 726 
^^SlaOlyTrpThr Gly Pro Thr Thr Cys Gin Ala Pro Gly Thr 

220 225 



775 



TGC AAG GTT CAG AAC CAG TGG TAC TCC CAG TGT CTT CCT TGAGAAGGCC 
Cvs Lys Val Gin Asn Gin Trp Tyr Ser Gin Cys Leu Pro 
230 235 240 

CAAGATAGCC ATGTCTCTCT AGCATTCTTC CGGCGTCAGT CTGATCTGCC TATTTAATCA 835 
50 GGTCAGTCAA TATGTATCCA GAGATAATAA ATTATGTATA TT AT AG CAG 8B4 
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Ala Tyr Lys Ala Thr Thr Thr "Arg Tyr Tyr Asp Gly Gin Glu Gly Ala 
20 25 30 

Cys Gly Cys Gly Ser Ser Ser Gly Ala Phe Pro Trp Gin Leu Gly He 
5 35 40 45 

Gly Asn Gly Val Tyr Thr Ala Ala Gly Ser Gin Ala Leu Phe Asp Thr 
50 55 60 

10 Ala Gly Ala Ser Trp Cys Gly Ala Gly Cys Gly Lys -Cys Tyr Gin Leu 
65 70 75 80 



15 



30 



45 



55 



60 



Thr Ser Thr Gly Gin Ala Pro Cys Ser Ser Cys Gly Thr Gly Gly Ala 
85 90 95 

Ala Gly Gin Ser He He Val Met Val Thr Asn Leu Cys Pro Asn Asn 
100 105 110 



Gly Asn Ala Gin Trp Cys Pro Val Val Gly Gly Thr Asn Gin Tyr Gly 
20 115 120 125 

Tyr Ser Tyr His Phe Asp He Met Ala Gin Asn Glu He Phe Gly Asp 

130 135 140 

25 Asn Val Val Val Asp Phe Glu Pro He Ala Cys Pro Gly Gin Ala Ala 
145 150 15S 160 



Ser Asp Trp Gly Thr Cys Leu Cys Val Gly Gin Gin Glu Thr Asp Pro 
165 170 175 

Thr Pro Val Leu Gly Asn Asp Thr Gly Ser Thr Pro Pro Gly Ser Ser 
180 185 190 



Pro Pro Ala Thr Ser Ser Ser Pro Pro Ser Gly Gly Gly Gin Gin Thr 
35 195 200 205 

Leu Tyr Gly Gin Cys Gly Gly Ala Gly Trp Thr Gly Pro Thr Thr Cys 
210 215 220 

40 Gin Ala Pro Gly Thr Cys Lys Val Gin Asn Gin Trp Tyr Ser Gin Cys 
225 230 235 240 



Leu Pro 



(2) INFORMATION FOR SBQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 3: 

Tyr Lys Ala Thr Thr Thr Arg Tyr Tyr Asp Gly Gin Glu Gly Ala Cys 
1.5 10 15 

Gly Cys Gly Ser Ser Ser Gly Ala Phe Pro Trp Gin Leu Gly He Gly 
20 25 30 
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Asn Gly Val Tyr Thr Ala Ala "Gly Ser Gin Ala Leu Phe Asp Thr Ala 
35 40 45 

Gly Ala Ser Trp Cys Gly Ala Gly Cys Gly Lys Cys Tyr Gin Leu Thr 
50 55 60 

Ser Thr Gly Gin Ala Pro Cys Ser Ser Cys Gly Thr Gly Gly Ala Ala 
65 70 75 80 

Gly Gin Ser lie He Val Met Val Thr Asn Leu Cya Pro Asn Asn Gly 
85 90 95 

Asn Ala Gin Trp Cys Pro Val Val Gly Gly Thr Asn Gin Tyr Gly Tyr 
100 105 110 

Ser Tyr His Phe Asp He Met Ala Gin Asn Glu He Phe Gly Asp Asn 
115 120 125 

Val Val Val Asp Phe Glu Pro He Ala Cys Pro Gly Gin Ala Ala Ser 
20 130 135 140 

Asp Trp Gly Thr Cys Leu Cys Val Gly Gin Gin Glu Thr Asp Pro Thr 
145 150 155 160 

25 Pro Val Leu Gly Asn Asp 
165 



10 



15 



30 



(2) INFORMATION FOR SEQ ID NO: 4i 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAATTCTCAC GGTGAATOTA GGCCTTTTGT AGGGTAGGAA TTGTCACTCA AGCACCCCCA 60 

40 

ACCTCCATTA CGCCTCCCCC ATAGAGTTCC CAATCAGTGA GTCATGGCAC TGTTCTCAAA 120 

TAGATTGGGG AGAAGTTGAC TTCCGCCCAfl AGCTGAAGGT CGCACAACCG CATGATATAG 180 

45 GGTCGGCAAC GGCAAAAAAG CACGTGGCTC ACCGAAAAGC AAGATGTTTG CGATCTAACA 240 

TCCAGGAACC TGGATACATC CATCATCACG CACGACCACT TTGATCTGCT GGTAAACTCG 300 

TATTCGCCCT AAACCGAAGT GCGTGGTAAA TCTACACGTG GGCCCCTTTC GGTATACTGC 360 

50 

GTGTGTCTTC TCTAGGTGCA TTCTTTCCTT CCTCTAGTGT TGAATTGTTT GTGTTGGGAG 420 

TCCGAGCTGT AACTACCTCT GAATCTCTGG AGAATGGTGG ACTAACGACT ACCGTGCACC 480 

55 TGCATCATGT ATATAATAGT GATCCTGAGA AGGGGGGTTT GGAGCAATGT GGGACTTTGA 540 

TGGTCATCAA ACAAAGAACG AAGACGCCTC TTTTGCAAAG TTTTGTTTCG GCTACGGTGA 600 

AGAACTGGAT ACTTGTTGTG TCTTCTGTGT ATTTTTGTGG CAACAAGAGG CCAGAGACAA 660 

60 

TCTATTCAAA CACCAAGCTT GCTCTTTTGA GCTACAAGAA CCTGTGGGGT ATATATCTAG 720 

AGTTGTGAAG TCGGTAATCC CGCTGTATAG TAATACGAGT CGCATCTAAA TACTCCGAAG 780 

65 CTGCTGCGAA CCCGGAGAAT CGAGATGTGC TGGAAAGCTT CTAGCGAGCG GCTAAATTAG 840 
CATGAAAGGC TATGAGAAAT TCTGGAGACG GCTTGTTGAA TCATGGCGTT CCATTCTTCG 



900 
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ACAAGCAAAG CGTTCCGTCG CAGTAGCAGG CACTCATTCC CGAAAAAACT CGGAGATTCC 960 

TAAGTAGCGA TGGAACCGGA ATAATATAAT AGGCAATACA TTGAGTTGCC TCGACGGTTG 1020 

5 CAATGCAGGG GTACTGAGCT TGGACATAAC TGTTCCGTAC CCCACCTCTT CTCAACCTTT 1080 

GGCGTTTCCC TGATTCAGCG TACCCGTACA AGTCGTAATC ACTATTAACC CAGACTGACC 1140 

GGACGTGTTT TGCCCTTCAT TTGGAGAAAT AATGTCATTG CGATGTGTAA TTTGCCTGCT 1200 

10 

TGACCGACTG GGGCTGTTCG AAGCCCGAAT GTAGGATTGT TATCCGAACT CTGCTCGTAG 1260 

AGGCATGTTG TGAATCTGTG TCGGGCAGGA CACGCCTCGA AGGTTCACGG CAAGGGAAAC 1320 

15 CACCGATAGC AGTGTCTAGT AGCAACCTGT AAAGCCGCAA TGCAGCATCA CTGGAAAATA 1380 

CAAACCAATG GCTAAAAGTA CATAAGTTAA TGCCTAAAGA AGTCATATAC CAGCGGCTAA 1440 

TAATTGTACA ATCAAGTGGC TAAACGTACC GTAATTTGCC AACGCGTTGT GGGGTTGCAG 1500 

20 

AAGCAACGGC AAAGCCCACT TCCCACGTTT GTTTCTTCAC TCAGTCCAAT CTCAGCTGGT 1560 

GATCCCCCAA TTGGGTCGCT TGTTTGTTCC GGTGAAGTGA AAGAAGACAG AGGTAAGAAT 1620 

25 GTCTGACTCG GAGCGTTTTG CATACAACCA AGGGCAGTGA TGGAAGACAG TGAAATGTTG 1680 

ACATTCAAGG AG TATTTAG C CAAGGGATG CTTGAGTGTATC GTGTAAGGAG GTTTGTCTGC 1740 

CGATACGACG AATACTGTAT AGTCACTTCT GATGAAGTGG TCCATATTGA AATGTAAGTC 1600 

30 

GGCACTGAAC AGGCAAAAGA TTGAGTTGAA ACTGCCTAAG ATCTCGGGCC CTCGGGCTTC 1860 

GGCTTTGGGT GTACATGTTT GTGCTCCGGG CAAATGCAAA GTGTGGTAGG ATCGACACAC 1920 

35 TGCTGCCTTT ACCAAGCAGC TGAGGGTATG TGATAGGCAA ATGTTCAGGG GCCACTGCAT 1980 

. GGTTTCGAAT AGAAAGAGAA GCTTAGCCAA GAACAATAGC CGATAAAGAT AGCCTCATTA 2040 

AACGAAATGA GCTAGTAGGC AAAGTCAGCG AATGTGTATA TATAAAGGTT CGAGGTCCGT 2100 

40 

GCCTCCCTCA TGCTCTCCCC ATCTACTCAT CAACTCAGAT CCTCCAGK3AG ACTTGTACAC 2160 

CATCTTTTGA GGCACAGAAA CCCAATAGTC AACCGCGGAC TGCGCATCAT G 2211 

45 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1627 base pairs 
50 (B) TYPE: nucleic acid 

<C> STRANDEBNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

55 

GGCGGTATTG GCTACAGCGG CCCCACOOTC TGCGCCAGCG GCACAACTTG CCAGGTCCTG 60 

AACCCTTACT ACTCTCAGTG CCTGTAAAGC TCCGTGCGAA AGCCTGACGC ACCGGTAGAT 120 

60 TCTTGGTGAG CCCGTATCAT GACGGCGGCG GGAGCTACAT GGCCCCGGGT GATTTATTTT 180 

TTTTGTATCT ACTTCTGACC CTTTTCAAAT ATACGGTCAA CTCATCTTTC ACTGGAGATG 240 

CGGCCTGCTT GGTATTGCGA TGTTGTCAGC TTGGCAAATT GTGGCTTTCG AAAACACAAA 300 

65 

ACGATTCCTT AGTAGCCATG CATCGGGATC CTTTAAGATA ACGGAATAGA AGAAAGAGGA 360 
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AATTAAAAAA AAAAA AAAA CAAACATCCC GTTCATAACC CGTAGAATCG CCGCTCTTCG . 

TGTATCCCAG TACCACGGCA AAGGTATTTC ATGATCGTTC AATGTTGATA TTGTTCCCGC 
5 CAGTATGGCT GCACCCCCAT CTCCGCGAAT CTCCTCTTCT CGAACGCGGT AGTGGCGCGC 

CAATTGGTAA TGACCATAGG GAGACAAACA GCATAATAGC AACAGTGGAA ATTAGTGGCG 

CAATAATTGA GAACACAGTG AGACCATAGC TGGCGGCCTG GAAAGCACTG TTGGAGACCA 
10 ACTTGTCCGT TGCGAGGCCA ACTTGCATTG CTGTCAAGAC GATGACAACG TAGCCGAGGA 

CCGTCACAAG GGACGCAAAG TTGTCGCGGA TGAGGTCTCC GTAGATGGCA TAGCCGGCAA 
15 TCCGAGAGTA GCCTCTCAAC AGGTGGCCTT TTCGAAACCG GTAAACCTTG TTCAGACGTC 

CTAGCCGCAG CTCACCGTAC CAGTATCGAG GATTGACGGC AGAATAGCAG TGGCTCTCCA 

GGATTTGACT GGACAAAATC TTCCAGTATT CCCAGGTcAC AGTGTCTGGC AGAAGTCCCT 
20 TCTCGCGTGC AHTCGAAAGT CGCTATAGTG CGCAATGAGA GCACAGTAGG AGAATAGGAA 

CCCGCGAGCA CATTGTTCAA TCTCCACATG AATTGGATGA CTGCTGGGCA GAATGTGCTG 
25 CCTCCAAAAT CCTGCGTCCA ACAGATACTC TGGCAGGGGC TTCAGATGAA TGCCTCTGGG 

CCCCCAGATA AGATGCAGCT CTGGATTCTC GQTTACNATG ATATCGCGAG AGAGCACGAG 

TTGGTGATGG AGGGACAGGA GGCATAGGTC GCGCAGGCCC ATAACCAGTC TTGCACAGCA 
30 TTGATCTTAC CTCACGAGGA GCTCCTGATG CAGAAACTCC TCCATGTTGC TGATTGGGTT 1320 

6AGAATTTCA TCGCTCCTGG ATCGTATGGT TGCTGGCAAO ACCCTGCTTA ACCGTGCCGT 
35 GTCATGGTCA TCTCTGGTGG CTTCGTCGCT GGCCTGTCTT TGCAATTCGA CAGCAAATGG 

TGGAGATCTC TCTATCGTGA CAGTCATGGT AGC6ATAGCT AGGTGTCGTT GCACGCACAT 

AGGCCGAAAT GCGAAGTGGA AAGAATTTCC CGGNTGCGGA ATGAAGTCTC GTCATTTTGT 
" ACTCGTACTC GACACCTCCA CCGAAGTGTT AATAATGGAT CCAOGATOCC AAAAAGCTTG 

TGCATGC 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 

55 GAATTCTCAC GGTGAATGTA GGCCTTTTGT AGGGTAGGAA TTGTCACTCA AGCACCCCCA 

ACCTCCATTA CGCCTCCCCC ATAGAGTTCC CAATCAGTGA GTCATGGCAC TGTTCTCAAA 

60 TAGATTGGGG AGAAGTTGAC TTCCGCCCAG AGCTGAAGGT CGCACAACCG CATGATATAG 

GGTCGGCAAC GGCAAAAAAG CACGTGGCTC ACCGAAAAGC AAGATGTTTG CGATCTAACA 
TCCAGGAACC TGGATACATC CATCATCACG CACGACCACT TTGATCTGCT GGTAAACTCG 

" TATTCGCCCT AAACCGAAGT GCGTGGTAAA TCTACACGTG GGCCCCTTTC GGTATACTGC 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1380 
1440 
1500 
1560 
1620 
1627 



60 
120 
180 
240 
300 
360 
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GTGTGTCTTC TCTAGGTGCA TTCTTTCCTT CCTCTAGTGT TGAATTGTTT GTGTTGGGAG 420 

TCCGAGCTGT AACTACCTCT GAATCTCTGG AGAATGGTGG ACTAACGACT ACCGTGCACC 480 

TGCATCATGT ATATAATAGT GATCCTGAGA AGGGGGGTTT GGAGCAATGT GGGACTTTGA S40 

TGGTCATCAA ACAAAGAACG AAGACGCCTC TTTTGCAAAG TTTTGTTTCG GCTACGGTGA 600 

AGAACTGGAT ACTTGTTGTG TCTTCTGTGT ATTTTTGTGG CAACAAGAGG CCAGAGACAA 660 

TCTATTCAAA CACCAAGCTT GCTCTTTTGA GCTACAAGAA CCTGTGGGGT ATATATCTAG 720 

TGGCCAGAAT GCCTAGGTCA CCTCTAGAGA GTTGAAACTG CCTAAGATCT CGGGCCCTCG 780 

GGCTTCGGCT TTGGGTGTAC ATGTTTGTGC TCCGGGCAAA TGCAAAGTGT GGTAGGATCG 840 

ACACACTGCT GCCTTTACCA AGCAGCTGAG GGTATGTGAT AGGCAAATGT TCAGGGGCCA 900 

CTGCATGGTT TCGAATAGAA AGAGAAGCTT AGCCAAGAAC AATAGCCGAT AAAGATAGCC 960 

TCATTAAACG AAATGAGCTA GTAGGCAAAG TCAGCGAATG TGTATATATA AAGGTTCGAG 1020 

GTCCGTGCCT CCCTCATGCT CTCCCCATCT ACTCATCAAC TCAGATCCTC CAGGAGACTT 1080 

GTACACCATC TTTTGAGGCA CAGAAACCCA ATAGTCAACC GCGGACTGCG CATCATG 1137 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGTCTGAAGG ACGTGGAATG ATGG 24 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GATGCATCGA TCGTCCGCGG GTTGAGAGAA GTTGTTGGAT TGATCAAAAA G 51 



(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GAGAGACCGC GGTGATCTTC CATCTCGTGT CTTGCTTGTA AC 42 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDHESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

10 

ATCGTGGATC CATTATTAAC ACTTCGGTGG 
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Claims: 

1 . An isolated DNA sequence, characterized in that it codes for a Trichoderma 
enzyme having endoglucanase activity, the molecular weight of the unglycosylated form of 

5 said enzyme being 20 to 25 kDa and said enzyme comprising a core domain, a linker region 
and a cellulose binding domain, and functional parts thereof. 

2. The DNA sequence according to claim 1, wherein the DNA sequence hybridizes to the 
DNA sequence of SEQ ID NO. 1 or to the DNA sequence of SEQ ID NO. 1 1. 

10 

3. The DNA sequence according to claim 1, wherein the DNA sequence codes for the amino 
acid sequence of SEQ ID NO. 2. 

4. The DNA sequence according to claim 1, wherein the DNA sequence is the DNA 
15 sequence of SEQ ID NO. 1 or to the DNA sequence of SEQ ID NO. 1 1 . 

5. A DNA sequence, which codes for a polypeptide having endoglucanase activity, said 
sequence coding for the sequence of SEQ ID NO, 3 or functional equivalents thereof. 

20 6. A vector construction, characterized in that it comprises the DNA sequence of 
any one of claims 1 to 5. 

7. A microorganism host, characterized in that it has been transformed with the 
DNA sequence of any one of claims 1 to 5 or with a vector construction of claim 6 and is 

25 able to express said DNA sequence. 

8. The host according to claim 7, wherein said host is a fungal or yeast host. 

9. The host according to claims 7 or 8, wherein said host is Trichoderma. 

30 

10. The host according to claims 7 or 8, wherein said host is Saccharomyces. 
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1 1. A culture medium, characterized in that is comprises the enzymes secreted 
from the host of claims 7 to 10. 

12. A product derived from the culture medium of claim 11 by purifying, concentrating, 
5 drying or immobilizing said culture medium. 

13. A method for isolating a DNA sequence coding for Trichoderma enzyme having endoglu- 

canase activity, 
characterized by 
10 - enriching the mRNA pool of a Trichoderma strain producing endoglucanase activity in 

respect of the mRNA of the endoglucanase by culruring the Trichoderma strain in 
conditions which will induce the endoglucanase production of said strain, 

- isolating mRNA from the strain, 

- preparing cDNA corresponding to the isolated mRNA, 

15 - placing the cDNA thus obtained in a vector under the control of a yeast promoter, 

- transforming the recombinant plasmids into a yeast strain which naturally does not 
produce the endoglucanase in order to provide an expression library, 

. cultivating the yeast clones thus obtained on a culture medium in order to express the 
expression library in the yeast, 
20 - separating the yeast clones producing the endoglucanase from the other yeast clones, 

- isolating the plasmid-DNA of said separated yeast clones, and, 

. if desired, sequencing the DNA in order to deterrnine the DNA sequence coding for the 
endoglucanase. 

25 14. The method according to claim 13, wherein the recombinant plasmids are transformed into 
a strain of the yeast Saccharomyces cerevisiae. 

15. The method according to claim 13, wherein the yeast clones are cultivated on a culture 
medium containing at least one substrate selected from the group comprising p-glucan, hydro- 

30 xyethyl cellulose, methylumbeiliferyl lactoside and methylumbelliferyl cellobioside. 

16. A method for constructing a Trichoderma host capable of expressing an endoglucanase 
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enzyme, characterized in that it comprises 

a) isolating the DNA sequence coding for an endoglucanase, the molecular weight of 
which in unglycosylated form is 20 to 25 kDa, or parts thereof, from a suitable donor 
strain, 

5 b) constructing a vector carrying said DNA sequence and 

c) transforming the vector obtained into a Trichoderma host. 

17. A method for producing a Trichoderma endoglucanase enzyme, characterized 
in that it comprises the steps of 
10 a) isolating the DNA sequence coding for an endoglucanase, the molecular weight of 

which in unglycosylated form is 20 to 25 kDa, or functional parts thereof, from a 

suitable donor strain 

b) constructing a vector carrying said DNA sequence, 

c) transforming the vector obtained to a Trichoderma host to obtain a recombinant host 
15 strain, 

d) cultivating said recombinant host strain under conditions permitting expression of 
said endoglucanase, and 

e) recovering said endoglucanase. 

20 18. A method for constracting a Saccharomyces host capable of expressing an endoglucanase 
enzyme, characterized in that it comprises 

a) isolating the DNA sequence coding for an endoglucanase, the molecular weight of 
whose unglycosylated form being 20 to 25 kDa, and functional parts thereof, from a 
suitable donor strain, 
25 b) constructing a vector carrying said DNA sequence and 

c) transforming the vector obtained into a Saccharomyces host 

19. A method for producing a Trichoderma endoglucanase enzyme, characterized 

by 

30 a) isolating the DNA sequence coding for an endoglucanase, the molecular weight of 

which is 20 to 25 kDa in unglycosylated form, or functional parts thereof, from a 
suitable donor strain 
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constructing a vector carrying said DNA sequence, 

transforming the vector obtained to a Saccharomyces host to obtain a recombinant 
host strain, 

cultivating said recombinant host strain under conditions permitting expression of 
said endoglucanase, and 
recovering said endoglucanase. 

20. An enzyme preparation, characterized in that it contains an endoglucanase 
enzyme having the amino acid sequence of SEQ ID NO. 2 or functional derivatives thereof. 

10 

21. An enzyme preparation, characterized inthatit contains elevated levels of an 
endoglucanase enzyme having in unglycosylated form a molecular weight in the range from 
20 to 25 kDa, or functional parts thereof, and exhibiting catalytic activity towards the 
substrates 0-glucan. 

15 

22. The enzyme preparation according to claim 22, wherein the endoglucanase enzymes 
exhibits activity towards crystalline cellulose substrate. 

23. A method for enzymatically modifying a cellulosic substrate, character- 

20 i z e d by contacting said substrate with an enzyme preparation according to any one of 
claims 20 to 22. 

24. The method according to claim 23, wherein the cellulosic substrate is fibrous. 
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b) 
c) 

d) 

e) 
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GATCTTCCATCTCGTGTCTTGCTTGTAACCATCGTGACCATGAAGGCAACTCTGGTTCTC fcO 

MetLy s Al aThr LeuVa lLeu 7 



GGCTCCCTCATTGTAGGCGCCGTTTCCGCGTACAAGGCCACCACCACGCGCTACTACGAT 11D 

GlySerLeuIleValGlyAlaValSerAla TyrLysAlaThrThrThrArgTyrTyrAsp 2* 

GGG C AGG AGGGTG CTTGCGG ATG CGG CTCG AGCTC CGGCG CATTC C CGTGG C AGCTCGG C iSO 
GlyGlnGluGlyAlaCysGlyCysGlySerSerSerGlyAlaPheProTrpGlnLeuGly 



ATCGGCAACGGAGTCTACACGGCTGCCGGCTCCCAGGCTCTCTTCGACACGGCCGGAGCT 2.HD 
IleGlyAenGlyValTyrThrAlaAlaGlySerGlnAlaLeuPheAspThrAlaGlyAla M 



TCATGGTGCGGCGCCGGCTGCGGTAAATGCTACCAGCTCACCTCGACGGGCCAGGCGCCC 3 DO 
SerTrpCysGlyAlaGlyCysGlyLysCysTyrGlnLeuThrSerThrGlyGlnAlaPro $? 

TGCTCCAGCTGCGGCACGGGCGGTGCTGCTGGCCAGAGCATCATCGTCATGGTGACCAAC 3U> 
CysSftrSerCysGlyThrGlyGlyAlaAlaGlyGlnSerllelleValMctValThrAsn 10? 



CTGTGCCCGAACAATGGGAACGCGCAGTGGTGCCCGGTGGTCGGCGGCACCAACCAATAC H2Q 
LeuCysProAsnAsnGlyAsnAlaGlnTrpCy sProValValGlyGlyThrAsnGlnTyr \ &± 



GGCTACAGCTACCATTTCGACATCATGGCGCAGAACGAGATCTTTGGAGACAATGTCGTC HtO 
GlyTyrSerTyrHisPheAspIleMetAlaGXnAcnGluIlePheGlyAspAsnValVal 1»f? 



GTCGACTTTGAGCCCATTGCTTGCCCCGGGCAGGCTGCCTCTGACTGGGGGACGTGCCTC 5H0 
ValAspPheGluProIleAlaCysProGlyGlnAlaAlaSerAspTrpGlyThrCysLeu 
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TAGCCATGTCTCTCTAGCATTCTTCCGGCGTCAGTCTGATCTGCCTATTTAATCAGGTCA tHO 
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